Towards a standard sampling methodology on online social networks: collecting global trends on Twitter
نویسندگان
چکیده
One of the most significant current challenges in large-scale online social networks, is to establish a concise and coherent method able to collect and summarize data. Sampling the content of an Online Social Network (OSN) plays an important role as a knowledge discovery tool. It is becoming increasingly difficult to ignore the fact that current sampling methods must cope with a lack of a full sampling frame i.e., there is an imposed condition determined by a limited data access. In addition, another key aspect to take into account is the huge amount of data generated by users of social networking services. This type of conditions make especially difficult to develop sampling methods to collect truly reliable data. Therefore, we propose a low computational cost method for sampling emerging global trends on social networking services such as Twitter. The main purpose of this study, is to develop a methodology able to carry out an efficient collecting process via three random generators: Brownian, Illusion and Reservoir. These random generators will be combined with a Metropolis-Hastings Random Walk (MHRW) in order to improve the sampling process. We demonstrate the effectiveness of our approach by correctly providing a descriptive statistics of the collected data. In addition, we also sketch the collecting procedure on real-time carried out on Twitter. Finally, we conclude with a trend concentration graphical description and a formal convergence analysis to evaluate whether the sample of draws has attained an equilibrium state to get a rough estimate of the sample quality.
منابع مشابه
What Trends in Chinese Social Media
There has been a tremendous rise in the growth of online social networks all over the world in recent times. While some networks like Twitter and Facebook have been well documented, the popular Chinese microblogging social network Sina Weibo has not been studied. In this work, we examine the key topics that trend on Sina Weibo and contrast them with our observations on Twitter. We find that the...
متن کاملA CSA Method for Assigning Client to Servers in Online Social Networks
This paper deals with the problem of user-server assignment in online social network systems. Online social network applications such as Facebook, Twitter, or Instagram are built on an infrastructure of servers that enables them to communicate with each other. A key factor that determines the facility of communication between the users and the servers is the Expected Transmission Time (ETT). A ...
متن کاملOf Pins and Tweets: Investigating How Users Behave Across Image- and Text-Based Social Networks
Today, it is the norm for online social (OSN) users to have accounts on multiple services. For example, a recent study showed that 34% of all Twitter users also use Pinterest. This situation leads to interesting questions such as: Are the activities that users perform on each site disjoint? Alternatively, if users perform the same actions on multiple sites, where does the information originate?...
متن کاملDesign and Test of the Real-time Text mining dashboard for Twitter
One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...
متن کاملDynamics of Trends and Attention in Chinese Social Media
There has been a tremendous rise in the growth of online social networks all over the world in recent years. It has facilitated users to generate a large amount of real-time content at an incessant rate, all competing with each other to attract enough attention and become popular trends. While Western online social networks such as Twitter have been well studied, the popular Chinese microbloggi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Applied Network Science
دوره 1 شماره
صفحات -
تاریخ انتشار 2016